Learning Semantic Visual Concepts from Video
نویسندگان
چکیده
Increasing amounts of digital video data have become available with the rapid growth in video technology. As a result, there is a great need for automatic extraction of concepts or events of interest from video. In this paper, we present an approach for learning concepts from video. The approach consists of three steps. In the first step, video shot boundaries are detected, and from these shots key frames are extracted, which are representatives of the shots. In the second step, key frames are segmented and a variety of features are computed. In the third step, a classification by feature partitioning method is employed for learning different semantic concepts. The results are presented for successfully learning semantic concepts such as ocean, mountain, people, and building from a variety of digital videos.
منابع مشابه
Recognition of Visual Events using Spatio-Temporal Information of the Video Signal
Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...
متن کاملThe Effect of Using Visual Aids, Semantic Elaboration, and Visual Aids plus Semantic Elaboration on Iranian Learners' Vocabulary Learning
This study investigated the effect of using visual aids, semantic elaboration, and visual aids plus semantic elaboration on the Iranian EFL learners' vocabulary learning. To conduct the study, the researchers assigned 49 elementary learners to three homogeneous groups according to their proficiency level. Then, a pre-test of Paribakht and Wesche's Vocabulary Knowledge Scale was given to each gr...
متن کاملMultimodal Video Concept Detection via Bag of Auditory Words and Multiple Kernel Learning
State-of-the-art systems for video concept detection mainly rely on visual features. Some previous approaches have also included audio features, either using low-level features such as mel-frequency cepstral coefficients (MFCC) or exploiting the detection of specific audio concepts. In this paper, we investigate a bag of auditory words (BoAW) approach that models MFCC features in an auditory vo...
متن کاملSemantic Video Retrieval Using High Level Context
Video retrieval – searching and retrieving videos relevant to a user defined query – is one of the most popular topics in both real life applications and multimedia research. This thesis employs concepts from Natural Language Understanding in solving the video retrieval problem. Our main contribution is the utilization of the semantic word similarity measures for video retrieval through the tra...
متن کاملAssessing Semantic Relevance by Using Audiovisual Cues
This paper presents two complementary approaches for assessing semantic relevance in video retrieval—(1) adaptive video indexing and (2) elemental concept indexing. Both approaches make extensive use of audiovisual cues. In the former, retrieval is performed by using implicit semantic indices through audio and visual features. Audio features are extracted by statistical time-frequency analysis ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002